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Claims: 



An apparatus for generating a hybrid speech model 
associated to a given speech element, said hybrid 
speech \ model being suitable for use by a speech 
recognition device during speech recognition, said 
apparatus comprising: 

input for receiving an input signal derived 
from a spoken utterance that contains at least 



- an\ 



one\speech element that potentially matches the 
given speech element; 



moqiel 



a mo<ae± group associated to the given speech 
element, said model group comprising ' a 
plurality of speech models, each speech model 
of saitl plurality of speech models being a 
different representation of the given speech 
element 

a processing unit coupled to the input for 
processing the input signal and the model group 
to generat^ a hybrid speech model associated to 
the given ^peech element, said hybrid speech 
model being \ a combination of speech models in 
said plurality of speech models on the basis of 
the input signal derived from the spoken 
utterance; 

— an output for releasing a signal indicative of 
said hybrid speech model associated to the 
given speech element in a format suitable for 
use by a speech recognition device. 
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2. An apparatus as defined in claim 1, wherein the given 
speech element is an element selected from the group 
consisting of phones, diphones, syllables and words. 

3. An apparatus as defined in claim 2, wherein said input 
signal derived from a spoken utterance is indicative of a 
speaker specific speech model associated to the at least 
one speech element. 

4. An apparatus as defined in claim 3, wherein said hybrid 
speech model is weighted toward the speaker specific 
speech model. 

5. An apparatus as defined in claim 4, wherein said hybrid 
speech model is derived by computing a linear combination 
of the speech models in said model group. 

6. An apparatus as defined in claim 5, wherein said input is 
a first input and wherein said input signal is a first 
input signal, said apparatus further comprising: 

a) a second input for receiving a second input 
signal conveying a data element identifying the 
given speech element; 

b) a database of model groups comprising a plurality 
of model groups, each model group being 
associated to a respective speech element, each 
model group comprising a set of speech models; 

said processing unit being further operative for 
extracting from said database of model groups a 
certain model group associated to the data element 
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received at said second input identifying the given 
speech element. 

A method for generating a hybrid speech model 
associated to a given speech element, said hybrid 
speech model being suitable for use by a speech 
recognition device during speech recognition, said 
method comprising: 

- receAving an input signal derived from a spoken 
utterance that contains at least one speech 
element that potentially matches the given 
speech element ; 

- providing a model group associated to the given 
speech element, said model group comprising a 
plurality \of speech models, each speech model 
of said plurality of speech models being a 
different representation of the given speech 
element ; 

- processing thh input signal and the model group 
to generate a hybrid speech model associated to 
the given speech element, said hybrid speech 
model being a combination of speech models in 
said plurality of\ speech models on the basis of 
the input signa\ derived from the spoken 
utterance; 

- releasing a signal Vindicative of said hybrid 
speech model associated to the given speech 
element in a format \ suitable for use by a 
speech recognition device. 
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computer readable medium comprising a program element 
suitable for execution by a computing apparatus for 
generating a hybrid speech model associated to a given 
speech element, said hybrid speech model being suitable 
for \use by a speech recognition device during speech 
recognition, said computing apparatus comprising: 

- a memory unit for storing a model group associated 
to \he given speech element, said model group 
comprising a plurality of speech models, each speech 
model V>f said plurality of speech models being a 
different representation of the given speech 
element ; 

- a processor operatively connected to said memory 
unit, said\program element when executing on said 
processor beiing operative for implementing: 

- an inbut for receiving an input signal 
derivedXfrom a spoken utterance that contains 
at leastXone speech element that potentially 
matches tire given speech element; 

- a processing unit coupled to the input for 
processing the input signal and the model 
group to generate a hybrid speech model 
associated to\the given speech element, said 
hybrid speech \jiodel being a combination of 
speech models in said plurality of speech 
models on the b^sis of the input signal 
derived from the spoken utterance; 

- an output for releasing a signal indicative 
of said hybrid speech, model associated to the 
given speech element iri a format suitable for 
use by a speech recognition device. 
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An Apparatus for generating a complex speech model 
associated to a given speech element, said complex 
speech\ model being suitable for use by a speech 
recognition device during speech recognition, said 
apparatus comprising: 

- an\ input for receiving an input signal derived 
from a spoken utterance that contains at least 



one^peech element that potentially matches the 
given speech element; 

a mocflel group associated to the given speech 
element/ said model group comprising a 
plurality of speech models, each speech model 
of saic\ plurality of speech models being a 
different representation of the given speech 
element, teaid model group comprising two sets 
of speech\ models namely a first set having 
speech models of a first type and a second set 
having speech models of a second type, each 
speech model^of a first type in said first set 
being associated to a speech model of the 
second type in\the second set; 



a processing unit coupled to the input for: 

a) processing^ the input signal and the model 
group to ctenerate a hybrid speech model 
associated t\> the given speech element, 
said hybrid \ speech model being a 
combination of\speech models of the first 
type in said plurality of speech models on 
the \i n Put sic 



the basis of 



Lgnal derived from 



the spoken utterance; 





m 
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) processing the hybrid speech model to 
\ generate a complex speech model associated 
to the given speech element, said complex 
speech model being a combination of speech 
models of the second type in said 
lurality of speech models; 
- an output for releasing a signal indicative of 
said complex speech model associated to the 
j=t given speech element in a format suitable for 

=2 10 use by a speech recognition device. 

if* 

|7 10. An apparatus as defined in claim 9, wherein any speech 

ijj model of a second tyne is indicative of a speech model 

having a higher complejoity than a speech model of a first 
15 type to which it is associated. 




11. An apparatus as defined in claim 10, wherein the given 



'S speech element is indicative of a data element selected 

from the set consisting of phones, diphones, syllables 
20 and words. 



12. An apparatus as defined in claim 11, wherein said input 
signal derived from a spoken utterance is indicative of a 
speaker specific speech model associated to the at least 

25 one speech element. 

13. An apparatus as defined in claim 12, wherein said 
hybrid speech model is weighted toward the speaker 
specific speech model. 
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14. An apparatus as defined in claim 13, wherein said 
hybrid speech model is derived by computing a linear 
combination of the speech models of the first type. 

15. An apparatus as defined in claim 14, wherein said 
linear combination is a first linear combination and is 
characterized by a first set of parameters indicative of 
weights associated to speech models of the first type, 
said complex speech model being derived by computing a 
second linear combination of the speech models of the 
second type, said second linear combination being 
characterized by a second set of parameters indicative of 
weights associated to speech models of the second type. 

16. An apparatus as defined in claim 15, wherein said first 
set of parameters and said second set of parameters are 
indicative of substantially same weights. 

17. An apparatus as defined in claim 10, wherein said input 
is a first input and wherein said input signal is a first 
input signal, said apparatus further comprising: 

a) a second input for receiving a second input 
signal indicative of a data element identifying 
the given speech element; 

b) a database of model groups comprising a plurality 
of model groups, each model group being 
associated to a respective speech element, each 
model group comprising two sets of speech models 
namely a first set having speech models of a 
first type and a second set having speech models 
of a second type, each speech model of a first 
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15 



type in said first set being associated to a 
speech model of the second type in the second 
set ; 

said processing unit being further operative for 
extracting from said database of model groups a 
certain model group associated to the data element 
received at said second input identifying the given 
speech element. 

A computer readable medium comprising a data structure 
for storing aN^plurality of model groups, each model 
group being associated to a respective speech element 
in a phonetic alphabet, each model group comprising a 



plurality of spee\^ models, each model group being 
suitable for use by ay processing device. 




20 



19. A computer readable medium as defined in claim 18, 
wherein the given speech element is indicative of a data 
"element selected from the set consisting of phones, 
diphones, syllables and words. 



25 
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A computer readable medium as defined in claim 19, 
wherein each model group comprises two sets of speech 
models Namely a first set having a plurality of speech 
models of\ a first type and a second set having a 
plurality o^ speech models of a second type, each 
speech modelXof a first type in said first set being 
associated to a speech model of the second type in the 
second set, each\ speech model of the second type being 
indicative of a speech model having a higher complexity 
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~1 that a Speech model of the first type to which the 
^< / speech mocPel of the second type is associated. 

21. A speech recognition system comprising 

- an axnput for receiving an input signal 
indicative of a spoken utterance that is 
indicative of at least one speech element; 

- a first\ processing unit coupled to said input 
operative for processing the input signal to 

10 derive fiom a speech recognition dictionary at 

least one\ speech model associated to a given 
speech element that constitutes a potential 
match to tne at least one speech element; 

- a second processing unit coupled to said first 
15 processing \unit for generating a modified 

version of ti\e at least one speech model on the 
basis of the input signal; 

- a third processing unit coupled to said second 
processing unit\ for processing the input signal 

20 on the basis of\ the modified version of the at 

least one speech model to generate a 
recognition result indicative of whether the 
modified version \of the at least one speech 
model constitutes \ match to the input signal; 

25 - an output for releasing a signal indicative of 

the recognition result. 
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22. A speech recognition system as defined in claim 21, 
wherein said first processing unit is operative for 
generating a speaker specific speech model derived on 
the basis of the input signal, the speaker specific 
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speech model being indicative of the acoustic 
characteristics of the least one speech element. 

23. A speech recognition system as defined in claim 22, 
wherein said modified version of the at least one 
speech model is indicative of a hybrid speech model 
associated to the given speech element. 



24. A speech recognition system as defined in claim 23, 
wherei\ said second processing unit comprises: 

supling member for allowing data exchange 
between the first processing unit and the 
seopnd processing unit, said coupling member 
bein>g suitable for receiving the speaker 
speciVic speech model derived from the input 
s.\gnal\ 

- a mode]\ group associated to the given speech 
element, \ said model group comprising a 
plurality\of speech models, each speech model 
of said plurality of speech models being a 
different representation of the given speech 
element ; 

a functional \ unit coupled to the coupling 
member for processing the speaker specific 
speech model an& the model group to generate 
the hybrid speech\ model associated to the given 
speech element, sa\id hybrid speech model being 
a combination of\ speech models in said 
plurality of speech Vtodels on the basis of the 
speaker specific speeofr model; 
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- an output coupling member for allowing data 
exchange between the second processing unit and 
the third processing unit, said output coupling 
member being suitable for releasing a signal 
indicative of the hybrid speech model 
associated to the given speech element. 

25. A speech recognition system as defined in claim 24, 
wherein said hybrid speech model is weighted toward the 
speaker specific speech model. 

26. A speech recognition system as defined in claim 24, 
wherein said hybrid speech model is derived by computing 
a linear combination of the speech models in said group 
of speech models. 

27. A speech recognition system as defined in claim 24, 
wherein said coupling member is a first coupling member, 
said second processing unit further comprising: 

a) a second coupling member for allowing data 
exchange between the first processing unit and 
the second processing unit, said second coupling 
member being suitable for receiving a data 
element identifying the given speech element; 

b) a database of model groups comprising a plurality 
of model groups, each model group being 
associated to a respective speech element, each 
model group comprising a set of speech models; 

said functional unit being further operative for 
extracting from said database of model groups a 
certain model group associated to the data element 
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received at said second coupling member identifying 
the given speech element. 

28. A speech recognition system as defined in claim 22, 
wherein said modified version of the at least one 
speech model is indicative of a complex speech model 
associated to the given speech element. 

29. A speech recognition system as defined in claim 28, 
wherein said second processing unit comprises: 

- coupling member for allowing data exchange 
between the first processing unit and the 
second processing unit, said coupling member 
being suitable for receiving the speaker 
specific speech model derived from the input 
signal ; 

- a model group associated to the given speech 
element, said model group comprising a 
plurality of speech models, each speech model 
of said plurality of speech models being a 
different representation of the given speech 
element, said model group comprising two sets 
of speech models namely a first set having 
speech models of a first type and a second set 
having speech models of a second type, each 
speech model of a first type in said first set 
being associated to a speech model of the 
second type in the second set; 

- a functional unit coupled to the coupling 
member for: 
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a) processing the speaker specific 
speech model and the model group to 
generate a hybrid speech model 
associated to the given speech 
element, said hybrid speech model 
being a combination of speech models 
of the first type in said plurality 
of speech models on the basis of the 
speaker specific speech model; 

b) processing the hybrid speech model to 
generate a complex speech model 
associated to the given speech 
element, said complex speech model 
being a combination of speech models 
of the second type in said plurality 
of speech models; 

- an output coupling member for allowing data 
exchange between the second processing unit and 
the third processing unit, said coupling member 
being suitable for releasing a signal 
indicative of the complex speech model 
associated to the given speech element. 

30. A speech recognition system as defined in claim 29, 
wherein anyy speech model of the second type is indicative 
of a speech, model having a higher complexity than a 
speech model od: the first type to which it is associated. 

31. A speech recognition system as defined in claim 30, 
wherein said hybrid speech model is weighted toward the 
speaker specific speech model. 




32. A speech recognition system as defined in claim 31, 
wherein said hybrid speech model is derived by computing 
a linear combination of the speech models of the first 
type. 



□ 
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33. A speech recognition system as defined in claim 32, 
wherein said linear combination is a first linear 
combination and is characterized by a first set of 

10 parameters indicative of weights associated to speech 

models of the first type, said complex speech model being 
derived by computing a second linear combination of the 
speech models of the second type, said second linear 
combination being characterized by a second set of 

15 parameters indicative of weights associated to speech 

models of the second type. 

34. A speech recognition system as defined in claim 33, 
wherein said first set of parameters and said second set 

20 of parameters is indicative of substantially the same 

weights . 

35. An apparatus for generating a hybrid speech model 
associated\ to a given speech element, said hybrid 

25 speech model being suitable for use by a speech 

recognition \ievice during speech recognition, said 
apparatus comprising: 

- means f^r receiving an input signal derived 
from a sr)oken utterance that contains at least 
30 one speech Xelement that potentially matches the 

given speech\element; 
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- a model group associated to the given speech 



said model 



group 



comprising 



element, 

plurality of speech models, each . speech model 
of said, plurality of speech models being a 
diffenent representation of the given speech 
element ; 

- means ftor processing the input signal and the 
model gnoup to generate a hybrid speech model 
associated to the given speech element, said 
hybrid speech model being a combination of 
speech models in said plurality of speech 
models on the basis of the input signal derived 
from the spoken utterance; 

— means for releasing a signal indicative of said 
hybrid speecV model associated to the given 
speech element in a format suitable for use by 
a speech recognition device. 



36. A speech recognition sysuem comprising: 

- means receiving an input signal indicative of a 
spoken utterance tmat is indicative of at least 
one speech element; 

- means for processing\ the input signal to derive 
from a speech recognition dictionary at least 
one speech model associated to a given speech 
element that constitutes a potential match to 
the at least one speech ^element; 

- means for generating a modified version of the 
at least one speech mode\ on the basis of the 
input signal; 
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